Adaptation Strategies for Automated Machine Learning on Evolving Data
نویسندگان
چکیده
Automated Machine Learning (AutoML) systems have been shown to efficiently build good models for new datasets. However, it is often not clear how well they can adapt when the data evolves over time. The main goal of this study understand effect stream challenges such as concept drift on performance AutoML methods, and which adaptation strategies be employed make them more robust. To that end, we propose 6 evaluate their effectiveness different approaches. We do a variety approaches building machine learning pipelines, including those leverage Bayesian optimization, genetic programming, random search with automated stacking. These are evaluated empirically real-world synthetic streams types drift. Based analysis, ways develop sophisticated robust techniques.
منابع مشابه
Machine Learning Models for Housing Prices Forecasting using Registration Data
This article has been compiled to identify the best model of housing price forecasting using machine learning methods with maximum accuracy and minimum error. Five important machine learning algorithms are used to predict housing prices, including Nearest Neighbor Regression Algorithm (KNNR), Support Vector Regression Algorithm (SVR), Random Forest Regression Algorithm (RFR), Extreme Gradient B...
متن کاملAutomated Machine Learning on Big Data using Stochastic Algorithm Tuning
We introduce a means of automating machine learning (ML) for big data tasks, by performing scalable stochastic Bayesian optimisation of ML algorithm parameters and hyper-parameters. More often than not, the critical tuning of ML algorithm parameters has relied on domain expertise from experts, along with laborious handtuning, brute search or lengthy sampling runs. Against this background, Bayes...
متن کاملStrategies and Principles of Distributed Machine Learning on Big Data
The rise of Big Data has led to new demands for Machine Learning (ML) systems to learn complex models with millions to billions of parameters, that promise adequate capacity to digest massive datasets and offer powerful predictive analytics (such as high-dimensional latent features, intermediate representations, and decision functions) thereupon. In order to run ML algorithms at such scales, on...
متن کاملSingle-molecule Biophysics: Machine Learning for Automated Data Processing
Single-molecule fluorescence microscopy has been greatly successful in understanding biophysics at molecular level. This technique has been advancing toward higher throughput, which creates a need for a data-analysis tool to distinguish molecule of interest from other fluorescence signals. Here, we have used supervised machine-learning approaches to filter biological events of our interest, and...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Pattern Analysis and Machine Intelligence
سال: 2021
ISSN: ['1939-3539', '2160-9292', '0162-8828']
DOI: https://doi.org/10.1109/tpami.2021.3062900